Overview

Dataset Statistics

Number of Variables 31
Number of Rows 3023
Missing Cells 12197
Missing Cells (%) 13.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 4.7 MB
Average Row Size in Memory 1.6 KB
Variable Types
  • Numerical: 4
  • Categorical: 25
  • GeoGraphy: 1
  • GeoPoint: 1

Dataset Insights

Age has 121 (4.0%) missing values Missing
Primary Fur Color has 55 (1.82%) missing values Missing
Highlight Fur Color has 1086 (35.92%) missing values Missing
Color notes has 2841 (93.98%) missing values Missing
Location has 64 (2.12%) missing values Missing
Above Ground Sighter Measurement has 114 (3.77%) missing values Missing
Specific Location has 2547 (84.25%) missing values Missing
Other Activities has 2586 (85.54%) missing values Missing
Other Interactions has 2783 (92.06%) missing values Missing
Hectare Squirrel Number is skewed Skewed
Unique Squirrel ID has a high cardinality: 3018 distinct values High Cardinality
Hectare has a high cardinality: 339 distinct values High Cardinality
Color notes has a high cardinality: 135 distinct values High Cardinality
Specific Location has a high cardinality: 304 distinct values High Cardinality
Other Activities has a high cardinality: 307 distinct values High Cardinality
Other Interactions has a high cardinality: 197 distinct values High Cardinality
Lat/Long has a high cardinality: 3023 distinct values High Cardinality
Hectare has constant length 3 Constant Length
Shift has constant length 2 Constant Length
Location has constant length 12 Constant Length
Lat/Long has all distinct values Unique
X has 3023 (100.0%) negatives Negatives
  • 1
  • 2
  • 3

Variables


X

numerical

Approximate Distinct Count 3023
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48368
Mean -73.9672
Minimum -73.9812
Maximum -73.9497
Zeros 0
Zeros (%) 0.0%
Negatives 3023
Negatives (%) 100.0%
  • X is skewed right (γ1 = 0.2253)

Quantile Statistics

Minimum -73.9812
5-th Percentile -73.9788
Q1 -73.9731
Median -73.9686
Q3 -73.9602
95-th Percentile -73.9543
Maximum -73.9497
Range 0.03144
IQR 0.01291

Descriptive Statistics

Mean -73.9672
Standard Deviation 0.007726
Variance 5.9696e-05
Sum -223602.7968
Skewness 0.2253
Kurtosis -1.0165
Coefficient of Variation -0.00010446

Y

numerical

Approximate Distinct Count 3023
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48368
Mean 40.7809
Minimum 40.7649
Maximum 40.8001
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Y is skewed right (γ1 = 0.3067)

Quantile Statistics

Minimum 40.7649
5-th Percentile 40.7674
Q1 40.7717
Median 40.7782
Q3 40.7912
95-th Percentile 40.7976
Maximum 40.8001
Range 0.03521
IQR 0.01954

Descriptive Statistics

Mean 40.7809
Standard Deviation 0.01029
Variance 0.00010579
Sum 123280.5186
Skewness 0.3067
Kurtosis -1.3239
Coefficient of Variation 0.00025221

Unique Squirrel ID

categorical

Approximate Distinct Count 3018
Approximate Unique (%) 99.8%
Missing 0
Missing (%) 0.0%
Memory Size 237888

Length

Mean 13.6927
Standard Deviation 0.4615
Median 14
Minimum 13
Maximum 14

Sample

1st row 37F-PM-1014-03
2nd row 21B-AM-1019-04
3rd row 11B-PM-1014-08
4th row 32E-PM-1017-14
5th row 13E-AM-1017-05

Letter

Count 9069
Lowercase Letter 0
Space Separator 0
Uppercase Letter 9069
Dash Punctuation 9069
Decimal Number 23255

Hectare

categorical

Approximate Distinct Count 339
Approximate Unique (%) 11.2%
Missing 0
Missing (%) 0.0%
Memory Size 205564

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 37F
2nd row 21B
3rd row 11B
4th row 32E
5th row 13E

Letter

Count 3023
Lowercase Letter 0
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 6046
  • Hectare has words of constant length

Shift

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 202541

Length

Mean 2
Standard Deviation 0
Median 2
Minimum 2
Maximum 2

Sample

1st row PM
2nd row AM
3rd row PM
4th row PM
5th row AM

Letter

Count 6046
Lowercase Letter 0
Space Separator 0
Uppercase Letter 6046
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (PM, AM) take over 50.0%
  • Shift has words of constant length

Date

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48368
Mean 1.0119e+07
Minimum 10062018
Maximum 10202018
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Date is skewed right (γ1 = 0.2557)

Quantile Statistics

Minimum 10062018
5-th Percentile 1.0062e+07
Q1 1.0072e+07
Median 1.0122e+07
Q3 1.0142e+07
95-th Percentile 1.0192e+07
Maximum 10202018
Range 140000
IQR 70000

Descriptive Statistics

Mean 1.0119e+07
Standard Deviation 42466.715
Variance 1.8034e+09
Sum 3.0591e+10
Skewness 0.2557
Kurtosis -1.1055
Coefficient of Variation 0.004197
  • Date is not normally distributed (p-value 4.058666901237434e-06)

Hectare Squirrel Number

numerical

Approximate Distinct Count 23
Approximate Unique (%) 0.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 48368
Mean 4.1237
Minimum 1
Maximum 23
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Hectare Squirrel Number is skewed right (γ1 = 1.4724)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 2
Median 3
Q3 6
95-th Percentile 10
Maximum 23
Range 22
IQR 4

Descriptive Statistics

Mean 4.1237
Standard Deviation 3.0965
Variance 9.5883
Sum 12466
Skewness 1.4724
Kurtosis 2.7897
Coefficient of Variation 0.7509
  • Hectare Squirrel Number is not normally distributed (p-value 1.726606135782556e-10)
  • Hectare Squirrel Number has 67 outliers

Age

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 121
Missing (%) 4.0%
Memory Size 204114
  • The largest value (Adult) is over 7.78 times larger than the second largest value (Juvenile)

Length

Mean 5.3356
Standard Deviation 0.966
Median 5
Minimum 1
Maximum 8

Sample

1st row Adult
2nd row Adult
3rd row Adult
4th row Adult
5th row Adult

Letter

Count 15480
Lowercase Letter 12582
Space Separator 0
Uppercase Letter 2898
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Adult, Juvenile) take over 50.0%
  • The largest value (adult) is over 7.78 times larger than the second largest value (juvenile)

Primary Fur Color

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 55
Missing (%) 1.8%
Memory Size 206463
  • The largest value (Gray) is over 6.31 times larger than the second largest value (Cinnamon)

Length

Mean 4.563
Standard Deviation 1.3533
Median 4
Minimum 4
Maximum 8

Sample

1st row Gray
2nd row Gray
3rd row Gray
4th row Cinnamon
5th row Gray

Letter

Count 13543
Lowercase Letter 10575
Space Separator 0
Uppercase Letter 2968
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Gray, Cinnamon) take over 50.0%
  • The largest value (gray) is over 6.31 times larger than the second largest value (cinnamon)

Highlight Fur Color

categorical

Approximate Distinct Count 10
Approximate Unique (%) 0.5%
Missing 1086
Missing (%) 35.9%
Memory Size 141477

Length

Mean 8.0392
Standard Deviation 3.8604
Median 8
Minimum 4
Maximum 22

Sample

1st row Cinnamon
2nd row White
3rd row Cinnamon
4th row White
5th row Cinnamon

Letter

Count 14746
Lowercase Letter 12396
Space Separator 413
Uppercase Letter 2350
Dash Punctuation 0
Decimal Number 0

Combination of Primary and Highlight Color

categorical

Approximate Distinct Count 22
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 228633

Length

Mean 10.6312
Standard Deviation 5.173
Median 10
Minimum 1
Maximum 27

Sample

1st row +
2nd row +
3rd row Gray+
4th row Gray+
5th row Gray+Cinnamon

Letter

Count 28289
Lowercase Letter 22971
Space Separator 413
Uppercase Letter 5318
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Gray+, Gray+Cinnamon) take over 50.0%

Color notes

categorical

Approximate Distinct Count 135
Approximate Unique (%) 74.2%
Missing 2841
Missing (%) 94.0%
Memory Size 19633
  • The largest value (Gray & Cinnamon selected as Primary. White selected as Highlights. Made executive adjustments.) is over 1.8 times larger than the second largest value (white belly)

Length

Mean 42.8736
Standard Deviation 37.2651
Median 23
Minimum 3
Maximum 153

Sample

1st row Nothing selected a...
2nd row just outside hecta...
3rd row Gray & White selec...
4th row Cinnamon stripe on...
5th row Nothing selected a...

Letter

Count 6423
Lowercase Letter 5957
Space Separator 1037
Uppercase Letter 466
Dash Punctuation 9
Decimal Number 13

Location

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 64
Missing (%) 2.1%
Memory Size 227843
  • The largest value (Ground Plane) is over 2.51 times larger than the second largest value (Above Ground)

Length

Mean 12
Standard Deviation 0
Median 12
Minimum 12
Maximum 12

Sample

1st row Above Ground
2nd row Above Ground
3rd row Ground Plane
4th row Ground Plane
5th row Ground Plane

Letter

Count 32549
Lowercase Letter 26631
Space Separator 2959
Uppercase Letter 5918
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Ground Plane, Above Ground) take over 50.0%
  • Location has words of constant length

Above Ground Sighter Measurement

categorical

Approximate Distinct Count 41
Approximate Unique (%) 1.4%
Missing 114
Missing (%) 3.8%
Memory Size 200941
  • The largest value (FALSE) is over 18.24 times larger than the second largest value (10)

Length

Mean 4.0756
Standard Deviation 1.5329
Median 5
Minimum 1
Maximum 5

Sample

1st row 10
2nd row FALSE
3rd row FALSE
4th row FALSE
5th row 30

Letter

Count 10580
Lowercase Letter 0
Space Separator 0
Uppercase Letter 10580
Dash Punctuation 0
Decimal Number 1276
  • The top 2 categories (FALSE, 10) take over 50.0%
  • The largest value (false) is over 18.24 times larger than the second largest value (10)

Specific Location

categorical

Approximate Distinct Count 304
Approximate Unique (%) 63.9%
Missing 2547
Missing (%) 84.2%
Memory Size 38158

Length

Mean 15.1639
Standard Deviation 11.8227
Median 12
Minimum 4
Maximum 102

Sample

1st row on tree stump
2nd row on tree roots
3rd row under a tree
4th row in b/w trees
5th row tree

Letter

Count 6014
Lowercase Letter 5680
Space Separator 1038
Uppercase Letter 334
Dash Punctuation 5
Decimal Number 49
  • The largest value (tree) is over 6.47 times larger than the second largest value (fence)

Running

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 210880
  • The largest value (False) is over 3.14 times larger than the second largest value (True)

Length

Mean 4.7585
Standard Deviation 0.4281
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 14385
Lowercase Letter 11362
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 3.14 times larger than the second largest value (true)

Chasing

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 211331
  • The largest value (False) is over 9.84 times larger than the second largest value (True)

Length

Mean 4.9077
Standard Deviation 0.2895
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row True
4th row False
5th row False

Letter

Count 14836
Lowercase Letter 11813
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 9.84 times larger than the second largest value (true)

Climbing

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 210952
  • The largest value (False) is over 3.59 times larger than the second largest value (True)

Length

Mean 4.7823
Standard Deviation 0.4127
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 14457
Lowercase Letter 11434
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 3.59 times larger than the second largest value (true)

Eating

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 210850
  • The largest value (False) is over 2.98 times larger than the second largest value (True)

Length

Mean 4.7486
Standard Deviation 0.4339
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row True
5th row False

Letter

Count 14355
Lowercase Letter 11332
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 2.98 times larger than the second largest value (true)

Foraging

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 210175

Length

Mean 4.5253
Standard Deviation 0.4994
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row True
5th row True

Letter

Count 13680
Lowercase Letter 10657
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%

Other Activities

categorical

Approximate Distinct Count 307
Approximate Unique (%) 70.2%
Missing 2586
Missing (%) 85.5%
Memory Size 36346

Length

Mean 17.4622
Standard Deviation 13.8434
Median 13
Minimum 4
Maximum 132

Sample

1st row grooming
2nd row walking
3rd row moving slowly
4th row sitting
5th row eating (ate upside...

Letter

Count 6337
Lowercase Letter 6337
Space Separator 859
Uppercase Letter 0
Dash Punctuation 15
Decimal Number 100

Kuks

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 211508
  • The largest value (False) is over 28.64 times larger than the second largest value (True)

Length

Mean 4.9663
Standard Deviation 0.1806
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 15013
Lowercase Letter 11990
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 28.64 times larger than the second largest value (true)

Quaas

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 211560
  • The largest value (False) is over 59.46 times larger than the second largest value (True)

Length

Mean 4.9835
Standard Deviation 0.1276
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 15065
Lowercase Letter 12042
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 59.46 times larger than the second largest value (true)

Moans

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 211607
  • The largest value (False) is over 1006.67 times larger than the second largest value (True)

Length

Mean 4.999
Standard Deviation 0.03149
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 15112
Lowercase Letter 12089
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 1006.67 times larger than the second largest value (true)

Tail flags

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 211455
  • The largest value (False) is over 18.5 times larger than the second largest value (True)

Length

Mean 4.9487
Standard Deviation 0.2206
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 14960
Lowercase Letter 11937
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 18.5 times larger than the second largest value (true)

Tail twitches

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 211176
  • The largest value (False) is over 5.97 times larger than the second largest value (True)

Length

Mean 4.8564
Standard Deviation 0.3507
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 14681
Lowercase Letter 11658
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 5.97 times larger than the second largest value (true)

Approaches

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 211432
  • The largest value (False) is over 15.98 times larger than the second largest value (True)

Length

Mean 4.9411
Standard Deviation 0.2354
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 14937
Lowercase Letter 11914
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 15.98 times larger than the second largest value (true)

Indifferent

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 210156

Length

Mean 4.519
Standard Deviation 0.4997
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row False
5th row False

Letter

Count 13661
Lowercase Letter 10638
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%

Runs from

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 210932
  • The largest value (False) is over 3.46 times larger than the second largest value (True)

Length

Mean 4.7757
Standard Deviation 0.4172
Median 5
Minimum 4
Maximum 5

Sample

1st row False
2nd row False
3rd row False
4th row True
5th row False

Letter

Count 14437
Lowercase Letter 11414
Space Separator 0
Uppercase Letter 3023
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (False, True) take over 50.0%
  • The largest value (false) is over 3.46 times larger than the second largest value (true)

Other Interactions

categorical

Approximate Distinct Count 197
Approximate Unique (%) 82.1%
Missing 2783
Missing (%) 92.1%
Memory Size 22180

Length

Mean 24.4833
Standard Deviation 17.4389
Median 21.5
Minimum 2
Maximum 106

Sample

1st row fenced off area ca...
2nd row gnd to tree
3rd row ran from dog-walke...
4th row me
5th row dog chased

Letter

Count 4797
Lowercase Letter 4797
Space Separator 832
Uppercase Letter 0
Dash Punctuation 8
Decimal Number 10

Lat/Long

categorical

Approximate Distinct Count 3023
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Memory Size 323178

Length

Mean 41.9064
Standard Deviation 0.6595
Median 42
Minimum 38
Maximum 45

Sample

1st row POINT (-73.9561344...
2nd row POINT (-73.9688574...
3rd row POINT (-73.9742811...
4th row POINT (-73.9596413...
5th row POINT (-73.9702676...

Letter

Count 15115
Lowercase Letter 0
Space Separator 6046
Uppercase Letter 15115
Dash Punctuation 3023
Decimal Number 90407
  • The largest value (point) is over 3023.0 times larger than the second largest value (407649106677138)

Interactions

Correlations

Missing Values